Write chunks with negative zero values and a zero fill value #3216

bojidar-bg · 2025-07-08T11:44:23Z

Fixes #3144.

Using np.any(self._data) was inspired by how Zarr v2 checks for equality with a falsey fill value.

TODO:

Add unit tests and/or doctests in docstrings
Add docstrings and API docs for any new/modified user-facing classes and functions
New/modified features documented in docs/user-guide/*.rst
Changes documented as a new file in changes/
GitHub Actions have all passed
Test coverage is 100% (Codecov passes)

tests/test_array.py

bojidar-bg · 2025-07-08T11:50:11Z

Oh, oops, thanks 😅

Fixes zarr-developers#3144

changes/3144.bugfix.rst

src/zarr/core/buffer/core.py

d-v-b · 2025-07-11T17:58:51Z

this test failure seems significant: https://github.com/zarr-developers/zarr-python/actions/runs/16172926021/job/45650861381?pr=3216#step:8:420

dcherian · 2025-07-11T18:01:00Z

this test failure seems significant

Yes looks like this approach doesn't work for complex number types

d-v-b · 2025-07-11T18:03:40Z

what if we view the array as raw bytes (should be cheap) and compare the raw bytes?

>>> import numpy as np
>>> np.array([0.0]) == np.array([-0.0])
array([ True])
>>> np.array([0.0]).view('V') == np.array([-0.0]).view('V')
array([False])

bojidar-bg · 2025-07-11T18:22:14Z

I wonder if that would somehow break with floating point subnormal-s and the like. Will have to experiment 🤔

Co-authored-by: Davis Bennett <davis.v.bennett@gmail.com>

bojidar-bg · 2025-08-01T13:35:28Z

Took me a bit, but finally got around to it. Subnormals are fine, and behave as expected; the only difference between Python's float equality and bitwise float equality is that signed zeroes compare as un-equal when comparing their bits, and that nan numbers can sometimes compare as equal when comparing their bits; the former is exactly what we want, and the latter won't occur since the code path is triggered only for signed zero fill values.

>>> import numpy as np
>>> np.array(1e-323).view('V') == np.array(0.0).view('V'), 1e-323 == 0.0
(array(False), False)
>>> np.array(1e-324).view('V') == np.array(0.0).view('V'), 1e-324 == 0.0
(array(True), True)
>>> np.array(-1e-323).view('V') == np.array(-0.0).view('V'), -1e-323 == -0.0
(array(False), False)
>>> np.array(-1e-324).view('V') == np.array(-0.0).view('V'), -1e-324 == -0.0
(array(True), True)
>>> np.array(-0.0).view('V') == np.array(0.0).view('V'), 0.0 == -0.0
(array(False), True)
>>> np.inf * 0.0
nan
>>> np.array(np.nan).view('V') == np.array(np.nan).view('V'), np.nan == np.nan
(array(True), False)
>>> np.array(np.inf * 0.0).view('V') == np.array(np.nan).view('V'), np.inf * 0.0 == np.nan
(array(False), False)

d-v-b · 2025-08-01T13:43:26Z

nan numbers can sometimes compare as equal when comparing their bits

This is actually potentially super useful, because the zarr v3 spec distinguishes between different types of nans, even though numpy does not. In order to ensure that arrays round-trip correctly through zarr python, we need to generate exactly the specific nan defined in the metadata. I did a quick check and numpy will preserve the underlying byte representation of different nans, so this should be possible.

np.array([b'\x00\x00\x00\x00\x00\x00\xFF\xFF'], dtype='|V8').view('float').view('V')
array([b'\x00\x00\x00\x00\x00\x00\xFF\xFF'], dtype='|V8')

bojidar-bg · 2025-08-01T13:54:31Z

Oh, that's curious! Probably not something I can quite incorporate in the code here... unless we make all floating point arrays use bitwise comparison for empty chunks.. 🤔

bojidar-bg · 2025-08-01T15:20:49Z

That .view("V") trick fails on GPU with a ZeroDivisionError, presumably in cupy:_core/core.pyx:81 when v_is == dtype.itemsize == 0, as is the case for the "V" dtype...
Use structured np.void dtypes to achieve the same idea doesn't work because np.void isn't hashable, but it works when using a sized "V" dtype, like "V16". ~~Let's see if that's enough to get all tests green 😂~~ YES IT IS! 🎉 That GPU test was stubborn! 😂😂

codecov · 2025-08-01T15:30:35Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 94.54%. Comparing base (f087c56) to head (58f45b7).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3216   +/-   ##
=======================================
  Coverage   94.54%   94.54%           
=======================================
  Files          78       78           
  Lines        9419     9423    +4     
=======================================
+ Hits         8905     8909    +4     
  Misses        514      514

Files with missing lines	Coverage Δ
src/zarr/core/buffer/core.py	`83.09% <100.00%> (+0.48%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

d-v-b · 2025-08-06T09:50:12Z

tests/test_array.py

+
+    # initialize the with the negated fill value (-0.0 for +0.0, +0.0 for -0.0)
+    arr[:] = -fill_value
+    assert arr.nchunks_initialized == arr.nchunks


this test is fine but ideally we would be testing the altered function explicitly, instead of indirectly via array creation + chunk writing. this is not a blocker for this PR, just something to sort out down the road

That test is basically copied from test_write_empty_chunks_behavior right above it. But yeah, it might be worth to have both a unit and an integration test in this case (:

d-v-b

thanks for this fix @bojidar-bg!

…ues and a zero fill value

github-actions bot added the needs release notes Automatically applied to PRs which haven't added release notes label Jul 8, 2025

d-v-b reviewed Jul 8, 2025

View reviewed changes

tests/test_array.py Outdated Show resolved Hide resolved

d-v-b reviewed Jul 8, 2025

View reviewed changes

tests/test_array.py Outdated Show resolved Hide resolved

d-v-b reviewed Jul 8, 2025

View reviewed changes

tests/test_array.py Outdated Show resolved Hide resolved

bojidar-bg force-pushed the 3144-negative-zero branch from d4c1205 to 2745b68 Compare July 8, 2025 11:48

github-actions bot removed the needs release notes Automatically applied to PRs which haven't added release notes label Jul 8, 2025

bojidar-bg force-pushed the 3144-negative-zero branch from 2745b68 to 7d6d74b Compare July 8, 2025 11:49

Write chunks with negative zero values and a zero fill value

c4904e3

Fixes zarr-developers#3144

bojidar-bg force-pushed the 3144-negative-zero branch from 7d6d74b to c4904e3 Compare July 8, 2025 11:53

d-v-b reviewed Jul 8, 2025

View reviewed changes

changes/3144.bugfix.rst Outdated Show resolved Hide resolved

bojidar-bg commented Jul 8, 2025

View reviewed changes

src/zarr/core/buffer/core.py Outdated Show resolved Hide resolved

d-v-b added 2 commits July 9, 2025 17:01

Update changes/3144.bugfix.rst

3f8c84f

Merge branch 'main' into 3144-negative-zero

1cc3250

dstansby added this to the 3.1.2 milestone Jul 31, 2025

Use bit patterns for comparing zeroes instead of signbit

683ec5f

Co-authored-by: Davis Bennett <davis.v.bennett@gmail.com>

fixup: Make sure fill value is a float before checking if it's == 0

c65774c

fixup: Make sure we don't copy arrays as we check for zeroes

919be15

bojidar-bg force-pushed the 3144-negative-zero branch from 0a596eb to 919be15 Compare August 1, 2025 14:02

Attempt using structured void dtypes for CuPy

dba8b0b

bojidar-bg force-pushed the 3144-negative-zero branch from f01d3c0 to dba8b0b Compare August 1, 2025 15:19

Merge branch 'main' into 3144-negative-zero

58f45b7

d-v-b reviewed Aug 6, 2025

View reviewed changes

d-v-b approved these changes Aug 6, 2025

View reviewed changes

d-v-b mentioned this pull request Aug 6, 2025

improve testing for buffer methods #3348

Open

d-v-b merged commit 1264a4d into zarr-developers:main Aug 6, 2025
31 checks passed

meeseeksmachine pushed a commit to meeseeksmachine/zarr-python that referenced this pull request Aug 6, 2025

Backport PR zarr-developers#3216: Write chunks with negative zero val…

4abf3ea

…ues and a zero fill value

meeseeksmachine mentioned this pull request Aug 6, 2025

Backport PR #3216 on branch 3.1.x (Write chunks with negative zero values and a zero fill value) #3349

Open

Uh oh!

Write chunks with negative zero values and a zero fill value #3216

Write chunks with negative zero values and a zero fill value #3216

Uh oh!

Conversation

bojidar-bg commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bojidar-bg commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

d-v-b commented Jul 11, 2025

Uh oh!

dcherian commented Jul 11, 2025

Uh oh!

d-v-b commented Jul 11, 2025

Uh oh!

bojidar-bg commented Jul 11, 2025

Uh oh!

bojidar-bg commented Aug 1, 2025

Uh oh!

d-v-b commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bojidar-bg commented Aug 1, 2025

Uh oh!

bojidar-bg commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

d-v-b Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

bojidar-bg Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

d-v-b left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bojidar-bg commented Jul 8, 2025 •

edited

Loading

d-v-b commented Aug 1, 2025 •

edited

Loading

bojidar-bg commented Aug 1, 2025 •

edited

Loading

codecov bot commented Aug 1, 2025 •

edited

Loading